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ABSTRACT 

We find that nonlinearities in the dark-matter power spectrum are dramatically smaller if the density 
field first undergoes a logarithmic mapping. In the Millennium simulation, this procedure gives a 
power spectrum with a shape hardly departing from the linear power spectrum for fc < 1 /iMpc""'^ at 
all redshifts. Also, this procedure unveils pristine Fisher information on a range of scales reaching 
a factor of 2-3 smaller than in the standard power spectrum, yielding 10 times more cumulative 
signal-to-noise aX z — Q. 

Subject headings: cosmology: theory — large-scale structure of universe — methods: statistical 



1. INTRODUCTION 



The clustering of galaxies on extragalactic scales is 
a fundamental cosmological observable. But most cos- 
mological analysis of galaxy power spectra has been 
confined to the largest , linear scales, with wavenumber 



2004 



Tegmark et al. 



k < .2/tMpc~^ (e.g., [Pope et al. 

2006 1 . This is because various effects obscure cosmologi 
cal information on nonlinear scales: scale-dependent bias 
between the observable galaxies and the theoretically 
more straightforward dark-matter distribution; redshift 
distortions from peculiar velocities; and the fact that cur- 
rently, N-body simulations are necessary to model the 
matter power spectrum accurately on nonlinear scales. 

In principle, these effects can be modeled, but re- 
cently, another effect has been investigated that further 
discourages prospectors looking on translinear scales, 
0.2ft,Mpc^^ ^ k < 0.8ft-Mpc~^, for cosmological infor- 
mation. Translinear scales are not fully linear, but they 
are larger than the characteristic scales of halos. Naively 
then, one expects the nonlinearities there to be weak, 
but the power spectrum variance and covariance are sur- 
prisingly large (Meiksin & White 19991 IScoccimarro et al. 



1999 Cooray fc Hu|200T I . I'his inakes the gain in cosmo 



logical parameter Fisher information modest if an anal- 
ysis of the large-scale po wer spectrum is extended to in- 
clude translinear scal es ( Rimes fc Hamilton 2005, 2006' 
Neyrinck et~ari|2006l [Neyrinck fc Szapudi,,2007; Lee fc, 
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2008 Takahashi et al 



model (e.g., Cooray fc Shet 



2009F 



At least in a halo 
of large-scale struc- 
ture, this effect comes from cosmic variance in the halo 
population: a chance preponderance of large halos in a 
survey increases the power spectrum dispr oportionately 
on translinear scales ( Neyrinck et al.|[2006 l. 
Figure [l] shows a s 



ice of the Millennium Simulation 



density field (MS; |Springel et al.|2005| . The bottom pan- 
els show log(l -|- d), as well as a Gaussianized (Joaussi in 
which the ranking of cell densities is preserved, but they 
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Fig. 1. — Dark matter density in a slice 2h ^ Mpc thick and 
250^-1 Mpc on a side in the Millennium simulation. 5 is the over- 
density at z = 0. (5init is at 2 = 127, scaled to have the same 
minimum as log(l -|- S), the natural logarithm of the density at 
z = 0. The Gaussianized density ^Gaussi at 2 = 0, preserves 
the ranking of cell densities but imposes a Gaussian PDF with 
Var((5Gauss) = Var[log(l + 5)]. Particular structures at 2 = and 
2 = 127 disagree because of drift as the simulation progresses. 
Sec http : //skysrv .pha. jhu. edu/" neyrinck/sonif ylss .html for 
sounds representing these panels. 'I'he similarity in the sounds (ex- 
cept <5's) suggests a similarity in their power spectra. 



are mapped to a Gaussian probability density function 
(PDF), ('log' denotes the natural logarithm.) The bot- 
tom panels show more structure than the 5 panel, which 
is why simulation visualizations often use logarithmic 
color tables. Figure [2] shows the high non-Gaussianity 
of the 6 PDF compared to the log(l + S) PDF. 

All that is visible in the S panel is a handful of discrete 
peaks (halos). This suggests why the standard power 
spectrum, Ps, is so sensitive to fiuctuations in the halo 
population, and also suggests why the halo model sue- 
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Fig. 2. — PDF's of S and log(l + S) measured in 2- h"^ Mpc cells 
in the Millennium simulation at z = 0. Each distribution has been 
calibrated to have standard deviation 1. 

cessfuUy describes Ps- The other panels contain obvious 
filamentary structure, which we suspect the halo model 
would have more difficulty describing. 

There are several reasons to use the log transform, 
despite the slig ht non-lognormality of the density PDF 
( Colombi 1994 1 . The lognormal distribution is simp le; its 
use in gala xy number density PDF's dates back to |Hub-| 
ble (19341. The log transform is easily reversible, and 
thus p reserves pixel- by-pixel information. [Coles fc Jones 
( 1991 1 showed that a lognormal density PDF emerges if 
peculiar velocities are assumed to grow according to lin- 
ear theory, and showed that a lognormal density PDF 
can explain many features observed in our U niverse. 
In Schrodi nger perturbation theory (SPT; Szapudi & 
2003 1, ^ = log(l -I- d)/2 is a natural density vari- 
able. In determining the variance of S, tree-level SPT 
in A captures most of the higher-order loop corrections 
in standard perturbation theory. This suggests that the 
power spectrum of the log-mapped density could pull in 
information from higher-order statistics of 6. 

Another motivation for using the log transform is to 
make the power spectrum more suited to describe the 
density field. The power spectrum contains all the cos- 
mological information in the Gaussian initial conditions; 
all higher moments are zero. The density field is sta- 
tistically invariant under translations and rotations at 
all epochs. Initially, there is also a symmetry between 
underdense and overdense regions, related to the initial 
Gaussianity of the density PDF. At late times, this sym- 
metry is broken, and high overdensities receive perhaps 
undue weight in measurement of the power spectrum. 
In this paper, we test the hypothesis that restoring the 
Gaussianity of the density PDF restores some informa- 
tion to the power spectrum. 

2. POWER SPECTRUM NONLINEARITIES 

We measure nonlinearities in power spectra of the 
log-density (-Piog(i+5)) and the Gaussianized overdensity 
(J^Gauss) using the dark-matter density field from the 500 
h~^Mpc ACDM Millennium simulation (MS). We use 
publicly available time snapshots of the density measured 
with a nearest-grid-point (NGP) method on a 256'^ grid. 
On this mesh, the shot noise is negligible; there is a mean 
of 600 and minimum of 6 particles per grid cell. 

Figure [3] compares the distortion gravitational evolu- 
tion imparts to various power spectra in the MS. We 
show the ratio of each power spectrum to the initial- 
conditions (z = 127) power spectrum, dividing by fac- 
tors to line up the power spectra in the lowest-fc bin. 
Also shown are the fluctuations of the initial power spec- 
trum with the same binning, dividin g by t he no- wiggle 
power spectrum of |Eisenstein fc Hu| (1998). We do not 
correct for the NGP pixel window function; for example. 
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jauss 

Millennium simulation 



3. — The top 3 panels show the ratios of Ps, Piog(i+5), ^.nd 
to the initial power spectrum at various redshifts of the 
We apply a multiplicative factor to each 
curve to line up the power spectra in the lowest-fc bin. The bottom 
panel shows the level of fluctuation in the initial power spectrum 
using the same bins, relative to a no-wiggle power spectrum. 
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Fig. 4. — The large-scale biases fiog{i+i)/^'i ^'Hd Poauss/ Pg 
in the Millennium simulation sampled on a 256^ grid at various 
redshifts, measured using the lowest-fe bin. Also shown are a few 
analvtic approximations, as given in the Appendix. The result in 
Eq. 3lk traces each bias almost perfectly through (S'^^^i) ~ 7. 

this causes the small-scale downturn in the bottom panel. 
We cut the plot slightly before the Nyquist wavenumber 
(fc = 1.6/iMpc~^), but effects from the NGP assign- 
ment may persist at the smallest scales plotted. However, 
these effects should be mild for the ratios of two similar 
power spectra, i.e. in the Piog(i+5) and Pcauss panels. 

Both Piog(i+5) and Pcauss experience minimal nonlin- 
earities compared to the standard power spectrum Ps, 
to the extent that we conjecture that they trace the lin- 
ear power spectrum itself well into the nonlinear regime. 
Possibly, a different transformation than a logarithm 
could give a better estimator of the linear power spec- 
trum, but the similarity of -Piog(i+5) and Pcauss to each 
other suggests that Piog(i+5) is close to optimal already. 

As structure develops, a multiplicative bias grows be- 
tween -Piog(i+5) and Ps on large scales, shown in Figure|4] 
for several redshifts of the MS. This bias depends on the 
variance of cell densities, and thus on the cell size used 
to sample the density field. 

In the Appendix, we estimate the large-scale bias be- 
tween P\og{i+s) and Ps analytically, both perturbatively 
(assuming small fluctuations) and non-perturbatively 
(assuming a lognormal density PDF). Figure |4] shows 
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the performance of these bias approximations. The non- 
perturbative result, 



lim 



+5){k) 



-Var[log(l+5ocll)] 



works almost perfectly through (^^^n) ~ 7. For the MS 
at this cell size, this occurs at z = 1.2. At lower z, pre- 
sumably, the density becomes insufficiently lognormal for 
the calculation to work perfectly. The perturbative re- 
sults, given in the Appendix, work for <C 1, as 



expected. W e use perturbation theory (e.g., Bernardeau 



et al.||2002|) on a linear power spectrum from CAMB 



( [Lewis et ar.|2000 1 to calculate the cumulants in the Ap- 



pendix. i'he radius of the top-hat filter we used is that of 
a sphere with the volume of a grid cell. The bias is almost 
identical for Pcauss, if Var(5Gauss) is set to Var[log(l-|-5)]. 

3. INFORMATION CONTENT 

We compare the information c ontents of P\o^(i+s) c^^d 
Ps using the signal-to-noisc (S/N;'Takahashi et al.'2009'), 
the Fisher information (Fisher 1935; Tcgmark et aL 1997) 
about the mean of a power spectrum itself. The y/N m 
a power spectrum over a range of bins TZ is defined as 
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C-ji is the covariance matrix_of the_ p^wer spectrum in 
bins, C,j = {{P, - Pi){Pj - Pj)) /{PiPj), not to be con- 
fused with the cumulants in the Appendix. Fisher infor- 
mation usually references a set of cosmological parame- 
ters, but an analysis of how Piog(i+5) varies with cosmo- 
logical parameters is beyond the scope of this paper. 

To measure the covariance matrices, we subject the 
MS to 248 different large-wavelength sinusoidal weight- 
ings tha t estimate the covarian ce in a single cubic sim- 
ulation (Hamilton et al. 2006|). This includes higher- 



order weightings than the 52 they recommend, reducing 
the noise somewhat, but increasing the minimum usable 
k € TZ. For Pi, we use the power spectrum of the un- 
weighted density. For Piog(i+5), we apply the weight- 
ings to log(l + (5) — (log(l + S)), subtracting off the mean 
because the weightings can cause a spike in the power 
spectrum if the mean is nonzero. 

Figure |5] shows S/N for various power spectra. The 
smallest k & TZ is fixed at the smallest k not directly 
affected by weightings, and we vary the maximum fcmax- 
On linear scales, S/N c>c k"^^^, proportional to the num- 
ber of modes. We plot the S/N to the Nyquist wavenum- 
ber, because results for a downgraded 128'^ grid were 
nearly identical to the 256'^ results up to the 128^ Nyquist 
wavenumber. Still, features at /cmax ~ 1 hMpc~^ should 
be interpreted cautiously. For P^, global p, the mean den- 
sity p used in S = p/p — 1 is the global mean of the 
simulation, while for Pg, the 'local' mean density is used, 
i.e. (p(r)w(r)^), where w{r) is one of the 248 weighting 
functions. Using the local mean actually boosts the S/N 
somewhat in Ps, because generally, denser regions have 
higher translinear power. 

The S/N in -Piog(i+5) continues to grow with fcj^^x 
an extra factor of 2-3 beyond where the S/N in Pg turns 
over, giving a factor of ~ 10 S/N increase on small scales. 
The S/N in Pcauss reaches a factor of 2 even higher. 
The global vs. local mean-density issue does not apply 
to Pcauss and PiogCi+s)- 
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Fig. 5. — A comparison of the signal-to-noise for different power 
spectra at z = 0, 1.2 and 127. Solid, dashed, and dotted curves 



show S/N in Ps, 



log(l+5) 



and Pcauss ■ The bold black curves use 



a degraded 128^ (from 256^) grid. At left, S/N are at 2 = 0; the 
right panel also shows S/N at 2 = 1.2. The Pg global p curve shows 
S/N for Pg if the global instead of local mean density is used to 
estimate 5 for each weighting. 

4. CONCLUSION 

We find that the power spectrum of the log of the mat- 
ter density, P\og{i+5), and the power spectrum of 6 af- 
ter Gaussianizing its PDF, Pcauss, suffer dramatically 
smaller nonlinearities than the standard matter power 
spectrum Pg on translinear scales at all redshifts tested. 
This is true for both the mean of the power spectrum and 
its covariance, as we measure from the high-resolution 
Millennium simulation (MS). Not only do Piog(i+5) and 
Pcauss seem to trace the linear power spectrum to nearly 
k — 1/iMpc^^, but they respectively contain factors of 
10 and 20 higher signal-to-noise than at z = 0. 

To constrain cosmology, it is necessary to study how 
Piog(i+5) varies with cosmological parameters. Much cur- 
rent large-scale-structure research is focused on baryon 
acoustic oscillations (BAO's), so a major issue to investi- 
gate is the degree of BAO attenuation in Piog(i+5). The 
MS by itself is too small to provide an adequate answer to 
this question. BAO detection would likely benefit from 
the lack of nonlinearities in Piog(i+5) on BAO scales, al- 
though the BAO's in Piog(i+5) might experience some 
smearing as in Pg. These results are also encouraging 
for non-BAO large-scale-structure cosmology, since they 
imply a significantly larger range of scales over which the 
linear power spectrum can be accessed. 

It is also necessary to investigate the effects of red- 
shift space distortions, shot noise, and galaxy bias on 
Piog(i+5). Shot noise could be a significant issue in realis- 
tic surveys because accurately constructing the log(H-^) 
and ^Gauss fields requires knowledge of the density even 



in voids. It could be that p 



log(l-|-5) 



has low sensitivity 



to the accuracy of densities estimated in voids, but it is 
likely th at adaptive density kernels or tes sellation meth- 
ods (e.g., van de Weygaert & Schaap 2009 1 will be needed 
to achieve substantial information gains with Piog(i+5) 
over Pg. The issue of galaxy bias could actually be a 
positive feature of Piog(i+5) 
at least in the limit of large S « 6+1, Pi 



even on linear scales, since 



log(l+5) 



IS insensi- 



tive to linear galaxy bias. Although Pcauss outperforms 
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Piog{i+s) in our idealized simulation, it remains to be 
seen what the optimal transform is in the face of shot 
noise and other observational issues. 

It is reassuring to find that at least in principle, much 
information that gravity seemed to have unfairly stripped 
from the matter power spectrum on translinear scales 
is retrievable with a simple transformation on the den- 
sity field. A log transform could also prove useful for 
other statistics, such as estimators of primordial non- 
Gaussianity from large-scale structure. 

We thank Andrew Hamilton for useful discussions. 
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APPENDIX 

ANALYTIC ESTIMATES OF THE LOG-DENSITY POWER SPECTRUM BIAS 



The power sp ectrum Pa of the log-density A = log(l -I- 5) [for simplicity, A is twice the quantity defined by Szapudi 
& Kaiser (2003l] is generally biased on large scales relative to the standard Ps- We can calculate the moments of the 
A field with brute-force perturbation theory for small S, and express them in terms of the connected moments of S. 

Since A is not a zero-mean field, we start with the mean for completeness. Expanding the logarithm: 
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where <^s is the average of the correlation function within grid cells (i.e. the variance of cell densities) . The main 
subtlety of the calculation is to take into account the connectedness of the moments (e.g., Szapudi||2009 for detailed 

/^3\ is'^s'^') 

explanation). The skewness £"3 and the cumulants Cm,m (below) are defined by Sz = j^r^ ^^id Cm,m — (a s )\s^)''^+'^-^ ' 
The calculation of {A1A2) is analogous to the above. Expanding the logarithm, and expressing the results in terms 
of connected moments: 



1 + 6 (2 - Ci,2) + e 7 - 2^3 - 4Ci,2 



2C1.3 



C, 



2,2 



(A2) 



The DC term <^s/4 affects the two-point correlation function, but not the power spectrum. The large-scale power 
spectrum bias is the coefficient of the large-scale correlation function subtracting off contributions from S^s- 

For the non-perturbative calculation, we introduce D = A — {A). (6) = 0, so ^e^^^"^^) = 1. Using the connected 
moment theorem (Wilf 1994': 'Szalay"1988 : Szapudi & Szalay 1993) , and assuming D to be Gaussian, making all of its 
higher-than-second-order connected moments vanish, we obtain 

l + ^,^(e^ie^=) = e<^i^^> + (^'>. (A3) 

Using exp {D1D2) ~ 1 + {D1D2), and Fourier transforming, yields the non-perturbative shift of exp (D^) in Eq. (jlj). 



